(S> *T <A Errors Corrected by the STIC sJfcsms Branch 

S»rh,l Number: O Z V. V 9 8 W ' ^S T^ ^ ±=L 

| | Changed a file from non-ASCII to ASCII * Verified by: _ (STIC staff) 

| | Changed the margins in cases where the sequence text was \vrapped" down to the next line. 

["yf Edited a format enor in the Current Application Data section, specifically: £ N T EE F^t 5 D 

| 1 Edited the Current Application Data section with the actual current number. The number inputted by the 
applicant was □ the prior application data; or O other ' . 



L _J Added the mandatory heading and subheadings for "Current Application Data". 

| [ Edited the "Number of Sequences" field. The applicant spelled out a numberinstead of using an integer. 

| | Changed the spelling of a mandatory field (the headings or subheadings), specifically.: 

| | Corrected the SEQ ID NO when obviously incorrect. The sequence numbers that were edited were: 

| | Inserted or corrected a nucleic number at the end of a nucleic line. SEQ ID NO's edited: 

| | Corrected subheading placement. All responses must be on the same line as each subheading. If the 
applicant placed a response below the subheading, this was moved to its appropriate place. 

[ j Inserted colons after headings/subheadings. Headings edited included: 

| | Deleted extra, invalid, headings used by an applicant, specifically: 

j | Deleted: □ non-ASCII "garbage" at the beginning/end of files; □ secretary initials/filename at end of file; 

□ page numbers throughout text; □ other invalid text, such as . 

j | Inserted mandatory headings, specifically: 

| 1 Corrected an obvious error in the response, specifically: 



| | Edited identifiers where upper case is used but lower case is required, or vice versa. 

| [ Corrected an error in the Number of Sequences field, specifically: 

| | A "Hard Page Break" code was inserted by the applicant. All occurrences had to be deleted. 

| | Deleted ending stop codon in amino acid sequences and adjusted the "(A)Length:" field accordingly (error 
due to a Patentln bug). Sequences corrected: 



Other: 



<=>n 



Examiner: The above corrections must be communicated to the applicant in the first Office 
Actfon. DO NOT send a copy of this form. 3/1/95 
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C--> 
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31 
32 
33 
34 
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<110> APPLICANT: XIANG, JINHUA 

WUNSCHMANN, SABINA 

SCHMIDT, WARREN 

STAPLETON/ JACK T. 
<120> TITLE OF INVENTION: FULL-LENGTH GB VIRUS 

ARE INFECTIOUS IN PRIMARY CD4. POSITIVE T 
<130> FILE REFERENCE: IOWA:030US 

<140> CURRENT APPLICATION NUMBER: US/09/828,498 

<141> CURRENT FILING DATE: 2001-04-05 

<150> PRIOR APPLICATION NUMBER: 60/253,390 

<151> PRIOR FILING DATE: 2000-11-27 

<150> PRIOR APPLICATION NUMBER: 60/195,597 

<151> PRIOR FILING DATE: 2000-04-06 

<160> NUMBER OF SEQ ID NOS : 2 

<170> SOFTWARE: Patentln Ver. 2.1 

<210> SEQ ID NO: 1 

<211> LENGTH: 9395 

<212> TYPE: DNA 

<213> ORGANISM: Hepatitis G virus 
SEQUENCE: 1 

cccccccccg gcactgggtg 
actcggcgcc gactcggcga 
ggtcgtaaat cccggtcatc 
ctcctcttgt gcctgcggcg 



C (HEPATITIS G VIRUS) 
CELLS 



RNA TRANSCRIPTS 



<400> 

tgacgtgggg gggttgatcc 
ctatctaagt agacgcaatg 
gggtggtgac agggttggta 
gtcttaagag aaggtcaaga 

gtgctggccc taccggtgtg aataagggcc cgacgtcagg 



gtcacccacc tgggcaaacg 
tgaccaatag gtttatccgg 
aggaccccaa accctgccct 



caagccccat aaaccgacgc 
ccggccaaaa ggtggtggat 
ctggtagcca ctataggtgg 
agaccgcgca cggtccacag 
ctcgtcgtta aaccgagccc 
acgcccacgt acggtccacg tcgcccttca atgtctctct 



cggcggcctg 
ttttctatac 
tggccccggc 
ccccggaaga 
tttgcaccga 
ccgcggccca 
acgtagccgg 



cagccggggt 
catcatggca 
cacccacgct 
catcgggttc 
ccgttgctgg 
gctcgttggg 
gatcctgggt 



cgttgaggcg ccgggtctac 



cgagttgaca 
tcccggtggg 
agcccaagaa 
gtccttctgc 
tgtcgagcga 
tgcctggaag 
ccactgtatc 
gaactgggga 
ctgggcgagg 
ctgatgccca 



aggaccagtg 
ccgggaaatg 
tccttcgggt 
tccttctcgt 
atgggcaata 
gcggatgcct 
aggcgggttt 
gcctgtacgg 



ggggccgggg gttatgggga 
catggggcca cccagctccg 



gagggcgggt 
ggttgaggcc 
tttcctcaca 
ggtggccctg 
ggctgtgcgg 
gcccttgtcg 



tttactccgg ggtcctgaca 
acctgaagtg tgcagtagaa 



ggcatttctc 
ggggccattc 
aattgctgtg 
gggtgcacgg 
cctggcaagt 
gtctcggctt 
gttggtgttg 
tgtgacgtta 



agtggggaag tgagttttgg agatggactg agcagttggc ctccaattac tggattttgg 
aatacctttg gaaagtccca tttgaatttt ggagaggagt gatgagcctg acccctctgt 
tggtttgggt ggccgcattg cttttgctgg agcaacggat tgtcatggtt ttcctgctgg 



tgacgatggc ggggatgttg caaggcgccc ccgcctccgt 
actacgggtt gaagtggcag tcatgctcct gcagggctaa 
gggagagggt gtgggatcga gggaatgtca cgctcttgtg 
gggtttgggt cccggccttt tgccaggcgg ttgggtgggg 
gccacggaca aaaccagtgg cccctatcat gcccccaata 
taacgtgcgt gtggggttcc gtgtcttggt ttgcctcgac 
tcgatgtgtg gagtttggtg ccggttggat ctgccagctg caccatagcc gctctagggt 
catcggatcg cgacacggtg gttgagctct ccgagtgggg agtcccgtgc gtaacgtgta 



tttggggtcc cgcccctttg 
cgggtcgcgt attcccactg 
tgactgcccc aacggcccct 
cgaccccatc acccattgga 
tgtctatggg tctgtgtccg 
cggcggtcgt gattcgaaga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 
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59 ttctggaccg tcggcctgct tcatgtggca cctgtgtgcg ggactgctgg cccgaaaccg 1680 

60 ggtcggttag attccctttc catcggtgcg gcacggggcc tcggctgaca aaggacttgg 1740 

61 aagctgtgcc cttcgtcaac aggacaactc ccttcaccat aaggggcccc ctgggcaacc 1800 

62 aggggagagg caacccggtg cggtcgcccc tgggttttgg gtcctacacc atgaccaaga 1860 

63 tccgggattc cctgcatttg gtgaaatgtc ccacaccagc catagagcct ccgactggaa 1920 

64 cgttcgggtt cttccccgga gtcccgccca ttaacaactg catgccgcta ggcacggaag 1980 

65 tgtctgaggc attgggcgga gctgggctta cgggggggtt ctacgagcct ctggttcgca 2040 

66 ggtgttcgga gctgatggga cgccgaaatc cggtttgccc ggggtacgca tggctgtcct 2100 

67 ctggtagacc tgacgggttc atacacgtcc aggggcacct gcaggaggtg gatgcgggca 2160 

68 acttcatccc tcctccacgc tggttgctct tggattttgt atttgtcctg ctctatctga 2220 

69 tgaagctggc tgaggcacgg ttggtcccgt tgatcttgct tctgctgtgg tggtgggtga 2280 

70 accagttggc ggttctagga ctgccggctg tggacgctgc cgtggcgggt gaagtttttg 2340 

71 cgggccctgc cttgtcatgg tgtttgggcc ttcccactgt cagtatgata ctaggtctag 2400 

72 caaacctggt gttgtacttt cggtggatgg gccctcagcg cctcatgttc ctcgtgttgt 2460 

73 ggaagctcgc tcggggagct ttcccgctgg cacttttgat ggggatttcg gcgacccgcg 2 520 

74 ggcgcacctc tgtgctcggg gccgagttct gcttcgatgt cacattcgag gtggacactt 2580 

75 cggtgttggg ctgggtggtg gccagcgtgg tggcttgggc catagcgctc ctgagctcaa 2 64 0 

76 tgagcgcagg ggggtggaag cacaaggccg tgatctatag gacgtggtgt aaagggtacc 2700 

77 aggctgtgcg ccagagggtg gtgcggagcc ccctcgggga ggggcgtcct accaagcttc 2760 

78 tgacgttcgc ctggtgcttg gcctcataca tctggccgga tgctgtgatg atggtggtgg 2820 

79 tggccttggt cctcctcttc ggcctgttcg acgcactgga ctgggccctg gaggagctcc 2880 

80 tggtctcccg gccctcgtta cggcgactgg cacgggtggt tgagtgctgt gtgatggcgg 294 0 

81 gcgagaaggc caccaccatc cgactggtct ccaagatgtg cgcaagaggg gcctacctgt 3000 

82 ttgaccacat gggctctttc tcgcgcgctg tcaaggagcg cttgttggaa tgggacgcgg 3060 

83 ctttggagcc cttgtcattc actaggacgg actgtcgcat catcagagat gccgcgagga 3120 

84 ccctgtcctg cggacagtgc gtcatgggtt tacccgtggt agcacggcgc ggtgatgagg 3180 

85 ttctcatcgg cgtctttcag gatgtgaatc atttgcctcc cgggtttgtc ccgactgcac 3 240 

86 cagttgtcat . ccgtcggtgc ggaaagggct tcctgggggt cacgaaggca gccttgacag 3300 

87 gtagggatcc tgacttacat ccagggaacg tcatggtgtt ggggacggct acgtcacgaa 3360 

88 gcatgggcac atgtctgaat ggcctgctgt tcacaacttt ccatggggct tcatcccgaa 34 20 

89 ccatcgccac gcccgtgggg gcccttaatc ccaggtggtg gtcagccagt gatgacgtca 34 80 

90 cggtgtaccc gcttccagat ggggcaactt cgttgacgcc ctgcacttgc caggcggagt 3 54 0 

91 cctgttgggt tattagatcc gacggggctt tgtgccatgg cttgagcaag ggggacaagg 3600 

92 ttgagctgga tgtggccatg gaggtctctg acttccgtgg ttcgtctggt tcaccggtcc 3 660 

93 tttgcgacaa agggcacgca gtaagaatgc tcgtgtcagt gctccactct ggcggcaggg 3720 

94 ttactgcggc gcgattcact aggccgtgga ctcaagtacc aacagatgcc aagactacca 3780 

95 cagaaccccc tccggtgccg gcaaaaggag ttttcaagga ggccccgttg tttatgccta 3 840 

96 cgggggcggg aaagagcacc cgcgtaccgt tggagtacgg caacatgggc cacaaggtct 3900 

97 tgatcttgaa cccgtcggta gctaccgtga gggccatggg cccatacatg gagcggctgg 3960 

98 cggggaaaca ccccagtatt tactgtggcc atgacaccac tgctttcaca aggatcactg 4020 

99 actcgcccct tacgtattcc acttacggaa ggtttttggc caaccctagg cagatgctga 4 080 

100 ggggtgtgtc ggtggtcatt tgtgacgagt gccacagtca tgactcaact gtgttgttgg 4140 

101 gcattgggcg tgtcagggag ctggcgcgag gatgtggagt gcaattggtg ctctacgcca 4200 

102 ctgccacccc tcccggatcc ccgatgaccc agcacccatc aatcattgag acaaaactgg 4260 

103 acgtgggaga gatccccttc tatgggcatg gcatacctct tgagcggatg cggaccggaa 4 320 

104 ggcatctcgt attctgccac tccaaggctg agtgcgagcg cctggcgggc cagttttcgg 4 380 

105 ctaggggggt aaatgccatc gcctattaca gggggaaaga cagttctatc atcaaagatg 4440 

106 gagacctggt ggtgtgtgct acagacgcac tatccactgg gtacactggg aacttcgatt 4 500 

107 ctgtcaccga ttgtgggtta gtggtggagg aggtcgtcga ggtgaccctt gatcccacca 4 560 
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108 ttaccatctc cctgcgcacg gtgcccgcgt cggctgaact gtcgatgcag cggcgaggac 4620 

109 gcacgggtag gggcaggtct gggcgctact actacgcggg ggtcggcaag gcccctgctg 4680 

110 gtgtggtgcg ctcaggtcct gtctggtcgg cggtggaagc cggtgtgacc tggtacggaa 474 0 

111 tggaacctga cctgacagca aacctactga gactttacga caactgccct tacaccgcag 4 800 

112 ccgtcgcagc tgacattggg gaagccgcgg tgttcttttc ggggcttgcc ccgttgagga 4 860 

113 tgcatcccga tgttagctgg gcaaaagttc gcggcgtcaa ctggcccttc ctggtgggtg 4 920 

114 ttcagcggac catgtgccgg gaaacactgt ctcccggccc atcggatgac ccccagtggg 4980 

115 caggtctgaa gggcccgaat cctgtcccac tcctgctgag gtggggcaat gatttaccat 5040 

116 ctaaagtggc cggccatcac atcgtggacg acctggtccg taggctcggg gtggcggagg 5100 

117 gttacgtccg ctgcgatgcg ggacccatct tgatggtggg cctcgctatt gcggggggca 5160 

118 tgatctatgc gtcatacacc gggtctctcg tggtggttac agactgggat gtgaaggggg 5220 

119 gtggcagccc cctttatcgg catggagacc aggccacgcc ccagccggtt gtgcaggtcc 5280 

120 ccccggtaga ccatcggccg gggggagagt ctgcgccatc ggatgccaac acagtgacag 5340 

121 atgcggtggc ggccatccag gtggattgcg attggtcagt catgaccctg tcgatcgggg 5400 

122 aagtgctgtc cttggcccag gctaagacgg ccgagg'ccta cgcagctacc accaagtggc 5460 

123 ttgctggctg ctacacgggg .acgcgggccg tccccactgt ttcaattgtt gacaagctct 5520 

124 tcgccggggg ctgggcggcg gtggtaggcc attgccacag tgtaatagct gcggcagtgg • 5580 

125 cggcctatgg ggcttctagg agccctccat tggctgctgc cgcttcctac ctcatggggt 564 0 

126 tgggcgtcgg aggcaacgcg caaacccgct tagcctccgc tctcctacta ggggccgctg 5700 

127 ggaccgctct gggcacgcct gtcgtggggt taaccatggc gggcgcgttc atgggaagtg 5760 

128 ctagcgtctc cccctccttg gtcaccattt tactgggggc cgtggggggc tgggagggcg 5820 

129 tggtgaatgc ggctagcctt gtcttcgact ttatggcggg gaaactatca tcagaagatc 5880 

130 tgtggtatgc catcccagtg ctaaccagtc cgggggcagg acttgcgggg atcgccctcg 5940 

131 ggttggtgtt gtactcagct aacaactctg gcactaccac ttggttgaac cgtctgctga 6000 

132 ctacattgcc aaggtcctca tgcatccctg acagttactt tcagcaggcc gattactgtg 6060 

133 acaaggtctc agctgtgctc cgacgcttga gcctcactcg caccgtggtt gccctggtca 6120 

134 acagggagcc taaggtggat gaggttcagg tggggtacgt ctgggacttg tgggagtgga 6180 

13 5 tcatgcgtca agtgcgcatg gtgatggcca gacttcgggc cctctgcccc gtggtgtcat 624 0 

136 tacccttatg gcactgcggg gaggggtggt ccggagaatg gttgttggac ggccatgttg 6300 

137 agagtcgttg tctttgtggt tgcgtgatca ccggtgatgt tttgaatggg caactcaaag 6360 

138 atccagttta ctctaccaag ctgtgcaggc attattggat ggggacagtc cctgtgaaca 6420 

139 tgctgggcta tggcgagacg tcgcctttgc tcgcctcaga caccccgaag gtggtaccat 64 80 

140 tcgggacgtc tgggtgggct gaggtggtgg tgacccctac ccacgttgtg atcaggcgaa 654 0 

141 catccgccta caaactgctg cgccagcaaa tcctgtcggc tgctgttgct gagccctatt 6600 

14 2 acgtcgacgg cataccggtc tcatgggacg cggacgcgcg agcgcctgcc atggtctatg 6660 
14 3 gccctgggca aagtgtcacc attgacgggg aacgctacac ccttccgcat caactgcggc 6720 

144 ttaggaatgt ggcgccctct gaggtgtcat ccgaggtgtc cattgacatt gggacggaga 67 80 

145 ctgaagactc agaactgact gaggccgacc tgccgccggc ggctgcagcc cttcaggcta 6840 

146 tcgagaatgc tgcgagaatt cttgaacctc acatagatgt catcatggaa gattgcagta 6900 

147 caccctctct ttgtgggagt agccgagaga tgcctgtgtg gggagaagac ataccccgca 6960 
14 8 ctccatcgcc agcacttatc tcggttactg agagcagccc agatgagaag accccgtcgg 7020 
14 9 tgtcttcctc gcaggaggat accccgtctt ctgactcatt cgaggtcatc caagagtccg 7080 

150 agacagccga aggggaggaa agcgtcttca acgtggctct ttccgtacta aaagccttgt 7140 

151 ttccacagag cgatgccaca agaaagctta ccgttaagat gtcatgctgt gttgagaaga 7200 

152 gcgtaacacg cttcttttca ttgggattga cggtcgctga cgtggcaagc ctgtgtgaga 7260 

153 tggaaatcca gaaccataca gcctattgtg acaaggtgcg cactccgctt gaattgcagg 7320 

154 ttgggtgctt ggtgggcaat gaacttacct ttgaatgtga caagtgtgag gctaggcaag 7380 

155 agaccttggc ttccttctct tacatttggt ctggggtgcc actgacgagg gccactccgg 7440 

156 ccaagccccc tgtggtgagg ccggttggct ccttgctggt ggccgacacc accaaggtgt 7500 
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157 atgtcaccaa cccggacaat gttgggagaa gagttgacaa ggttaccttc tggcgtgccc 7560 

158 ctagggttca tgacaaattc ctcgtggact ccatagagcg cgctaagagg gcagctcaag 7620 

159 cctgcctaag catgggttac acttatgagg aggcaataag gactgtaagg ccacatgctg 7680 

160 ccatgggctg gggatctaag gtgtcggtca aggacctcgc cacccctgcg gggaagatgg 7740 

161 ctgtccatga ccggctccag gagatacttg aagggacgcc agtccccttt actcttactg 7800 

162 tgaaaaagga agtgttcttc aaagaccgaa aggaagagaa ggccccccgc ctcattgtgt 7860 

163 tcccccccct ggacttccgg atagctgaaa agcttattct gggagaccct ggacgggtag 7920 

164 ccaaggcggt gttggggggg gcctacgcct tccagtacac cccaaatcag cgaattaggg 7980 

165 agatgctcaa actgtgggaa tcaaagaaga caccatgcgc catctgtgtg gacgccacat 8040 

166 gcttcgacag tagcataact gaagaggacg tggcgctgga gacagagctt tatgccctgg 8100 

167 cttcagacca tccagaatgg gtgcgtgccc tggggaaata ctatgcctct ggcacaatgg 8160 

168 taacccccga gggggtgcca gtgggtgaga ggtattgtag atcctcaggg gtcttgacca 8220 

169 ccagtgcgag caactgcttg acttgctata tcaaggtgaa agccgcctgt gagagggtgg 8280 

170 ggctgaaaaa tgtctcgctc ctcatcgctg gcgatgactg tttgatcata tgcgaacggc 8340 

171 ctgtgtgcga tcctagcgac gctttgggca gagccctggc gagctacggg tacgcatgcg 84 00 

172 agccttcgta tcatgcatca ctggacacgg cccccttctg ctccacttgg ctagctgagt 84 60 

173 gcaatgcaga tgggaaacgc catttcttcc tgaccacgga ctttcggagg cccctcgctc 8520 

174 gcatgtcgag cgagtacagt gacccaatgg cttcggccat cggttacatc ctcctatacc 8580 

175 cttggcatcc tatcacacgg tgggtcatca tccctcacgt gctcacctgc gcgtttaggg 8640 

176 gtggtggcac accgtctgat cctgtgtggt gccaggtaca tggtaattac tacaagtttc 8700 

177 cactggacaa actgcctaac atcatcgtgg ccctccacgg accagcagcg ttgagggtta 8760 

178 ccgcagacac aactaagaca aaaatggagg ctggcaaggt gctgagcgac ctcaagctcc 8820 

179 ctggcctagc agtccaccgg aagaaggccg gggcattgcg aacgcgtatg ctccggtcgc 8880 

180 gcggttgggc tgagttggct agggggctgt tgtggcgtcc aggcctgcgg cttccccctc 8940 

181 cggagattgc tggtatcccc gggggtttcc ccctttcccc cccctatatg ggggtggttc 9000 

182 atcaattgga tttcacaagc cagaggagtc gctggcggtg gttggggttc ttagccctgc 9060 

183 tcatcgtagc cctcttcggg tgaactaaat tcatctgttg cggcaaggtc cggtgactga 9120 

184 tcatcactgg aggaggttcc cgccctcccc gccccagggg tctccccgct gggtaaaaag 9180 

185 ggcccggcct tgggaggcat ggtggttact aaccccctgg cagggtcaaa gcctgatggt 9240 

186 gctaatgcac tgccacttcg gtggcgggtc gctaccttat agcgtaatcc gtgactacgg 9300 

187 gctgctcgca gagccctccc cggatggggc acagtgcact gtgatctgaa ggggtgcacc 9360 

188 ccggtaagag ctcggcccaa aggccgggtt ctact 9395 

191 <210> SEQ ID NO: 2 

192 <211> LENGTH: 2910 

193 <212> TYPE: PRT 

194 <213> ORGANISM: Hepatitis G virus 



196 


<400> SEQUENCE: 


2 
























197. 


Met 


Ser 


Leu 


Leu 


Thr 


Asn 


Arg 


Phe 


He 


Arg 


Arg 


Val 


Asp 


Lys 


Asp 


Gin 


198 


1 








5 










10 










15 




200 


Trp 


Gly 


Pro 


Gly 


Val 


Met 


Gly 


Lys 


Asp 


Pro 


Lys 


Pro 


Cys 


Pro 


Ser 


Arg 


201 








20 










25 










30 






203 


Trp 


Ala 


Gly 


Lys 


Cys 


Met 


Gly 


Pro 


Pro 


Ser 


Ser 


Ala 


Ala 


Ala 


Cys 


Ser 


204 






35 










40 










45 








206 


Arg 


Gly 


Ser 


Pro 


Arg 


He 


Leu 


Arg 


Val 


Arg 


Ala 


Gly 


Gly 


He 


Ser 


Leu 


207 




50 










55 










60 










209 


Phe 


Tyr 


Thr 


He 


Met 


Ala 


Val 


Leu 


Leu 


Leu 


Leu 


Leu 


Val 


Val 


Glu 


Ala 


210 


65 










70 










75 










80 


212 


Gly 


Ala 


He 


Leu 


Ala 


Pro 


Ala 


Thr 


His 


Ala 


Cys 


Arg 


Ala 


Asn 


Gly 


Gin 


213 










85 










90 










95 
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